A Unique, Consistent Identifier for Alternatively Spliced Transcript Variants
نویسندگان
چکیده
BACKGROUND As research into alternative splicing reveals the fundamental importance of this phenomenon in the genome expression of higher organisms, there is an increasing need for a standardized, consistent and unique identifier for alternatively spliced isoforms. Such an identifier would be useful to eliminate ambiguities in references to gene isoforms, and would allow for the reliable comparison of isoforms from different sources (e.g., known genes vs. computational predictions). Commonly used identifiers for gene transcripts prove to be unsuitable for this purpose. METHODOLOGY We propose an algorithm to compute an isoform signature based on the arrangement of exons and introns in a primary transcript. The isoform signature uniquely identifies a transcript structure, and can therefore be used as a key in databases of alternatively spliced isoforms, or to compare alternative splicing predictions produced by different methods. In this paper we present the algorithm to generate isoform signatures, we provide some examples of its application, and we describe a web-based resource to generate isoform signatures and use them in database searches. CONCLUSIONS Isoform signatures are simple, so that they can be easily generated and included in publications and databases, but flexible enough to unambiguously represent all possible isoform structures, including information about coding sequence position and variable transcription start and end sites. We believe that the adoption of isoform signatures can help establish a consistent, unambiguous nomenclature for alternative splicing isoforms. The system described in this paper is freely available at http://genome.ufl.edu/genesig/, and supplementary materials can be found at http://genome.ufl.edu/genesig-files/.
منابع مشابه
Increased Expression of Two Alternative Spliced Variants of CD1d Molecule in Human Gastric Cancer
Background: CD1d presents glycolipid antigens to invariant natural killer T (iNKT) cells. The role of CD1d in the development of peptic ulcer and gastric cancer has not been revealed, yet. Objective: To clarify the expression of alternatively spliced variants of CD1d in peptic ulcer and gastric cancer. Methods: Patients with dyspepsia were selected and divided into three groups of non-ulcer dys...
متن کاملBacterial Expression and Functional Characterization of A Naturally Occurring Exon6-less Preprochymosin cDNA
Chymosin (Rennin EC 3.4.23.4), an aspartyl proteinase, is the major proteolytic enzyme in the fourthstomach of the unweaned calf, and it is formed by proteolytic activation of its zymogene, prochymosin.Following the cloning of synthesized cDNAs on mRNA pools extracted from the mucosa of the calf fourthstomach, we have identified an alternatively spliced form of preprochymosin ...
متن کاملASDB: database of alternatively spliced genes
A database of alternatively spliced genes (ASDB) has been constructed based on (i) the results of the analysis of Swiss-Prot entries containing products of these genes and (ii) clustering procedure joining proteins that could arise by alternative splicing of the same gene. ASDB incorporates information about alternatively spliced genes, their products and expression patterns. It can be searched...
متن کاملThe distribution pattern of genetic variation in the transcript isoforms of the alternatively spliced protein-coding genes in the human genome.
By enabling the transcription of multiple isoforms from the same gene locus, alternative-splicing mechanisms greatly expand the diversity of the human transcriptome and proteome. Currently, the alternatively spliced transcripts from each protein-coding gene locus in the human genome can be classified as either principal or non-principal isoforms, providing that they differ with respect to cross...
متن کاملExtensive coupling of alternative splicing of pre-mRNAs of serine/arginine (SR) genes with nonsense-mediated decay.
In Arabidopsis, pre-mRNAs encoding serine/arginine (SR) proteins, key regulators of constitutive and alternative splicing, are extensively alternatively spliced. In seedlings, 13 SR genes are alternatively spliced to generate 75 transcripts, of which 53 contain a premature termination codon (PTC). However, it is not known if any of the PTC-containing splice variants are the targets of nonsense-...
متن کامل